Granularity in Structured Documents

نویسنده

  • Frans C. Heeman
چکیده

Structured documents have become a widely accepted concept for document manipulation applications like editing, formatting, and archiving. However, some aspects of structured documents are still not well understood. In particular, the transition in structured documents from logical structure to contents, is a grey area in which different systems use different interpretations. In this article, we discuss this granularity aspect of structured documents. We focus on the underlying concepts of structured documents without referring to any application, so that this discussion is kept clear from aspects that are not related to structured documents.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical Indexing and Flexible Element Retrieval for Structured Document

As more and more structured documents, such as SGML or XML documents become available on the Web, there is a growing demand to develop effective structured document retrieval which exploits both content and hierarchical structure of documents and return document elements with appropriate granularity. Previous work on partial retrieval of structured document has limited applications due to the r...

متن کامل

Translating Structured Documents

Machine Translation traditionally treats documents as sets of independent sentences. In many genres, however, documents are highly structured, and their structure contains information that can be used to improve translation quality. We present a preliminary approach to document translation that uses structural features to modify the behaviour of a language model, at sentence-level granularity. ...

متن کامل

Applying the IRstream Retrieval Engine for Structured Documents to INEX

For a long period of time the research activities in information retrieval have mainly addressed flat text files. Although there have been approaches towards multimedia data and structured data in the past, these topics gain increasing interest today in the context of XML data. To address structured multimedia data, an efficient combination of contentbased retrieval for multimedia data, retriev...

متن کامل

Improving Results for Focused and Relevance-in- Context Tasks

This is to certify that I have examined this copy of a master's thesis by Salil G. Bapat and have found that it is complete and satisfactory in all respects, and that any and all revisions required by the final examining committee have been made. Acknowledgements I would like to take this opportunity to thank several people who have contributed towards the successful completion of this thesis. ...

متن کامل

Dependence of Binary Associations on Co-occurrence Granularity in News Documents

We describe and formalize an approach to correlate binary associations (such as between entities and events, between persons and events, etc.) implied by News documents on the co-occurrence granularity (such as document-level, paragraph-level, sentence-level, etc.) of the corresponding text phrases in the documents. Specifically, we present both qualitative and quantitative characterization of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Electronic Publishing

دوره 5  شماره 

صفحات  -

تاریخ انتشار 1992